CarlosGG AI notes
Search
CTRL + K
CarlosGG AI notes
Search
CTRL + K
AI
Causality
Causal Inference
Causality for time series
Causality
Uplift modelling
Computer Vision
Background subtraction
Computer Vision
Deep CV
Deep image prior
Image and video captioning
Image-to-image translation
Inpainting and restoration
Object classification, image recognition
Object detection
Semantic segmentation
Super-resolution
Traditional Computer Vision (CV) techniques
Video Frame Interpolation
Video segmentation and prediction
Data Engineering
Cloud platforms
Dask
Data Engineering and Computer Science
Distributed Deep learning
Horovod
Jupyter
Keras
Machine Learning Operations (MLOps)
Open ML data
Pandas
Python
Pytorch
Tensorflow, Keras
Visualization
Xarray
Data Science
Data science applications
Data Science
Recommender systems and semantic search
Deep learning
Autoencoders
Capsule Neural networks (CapsNets)
Convolutional Neural Networks (CNNs)
Deep belief network
Diffusion models
Deep Learning (DL)
Encoder-decoder networks
Explainability methods for NNs
Fourier Neural Operator (FNO)
Generative Adversarial Networks (GANs)
Geometric deep learning
GFlowNets
Graph neural networks (GNNs)
Gated Recurrent Units (GRUs)
Implicit Neural Representations
KANs
Long Short-Term Memory networks (LSTMs)
Multilayer perceptrons (MLPs)
Multimodal learning
Neural Cellular Automata
Neural Ordinary Differential Equations
Neural processes
Normalizing flows
Probabilistic deep learning
Reservoir computing
Residual and dense neural networks
Recurrent Neural Networks (RNNs)
Spherical CNNs
Transformers
Generative AI
Agents
GenAI for audio
Generative AI for computer vision
GenAI for tabular data
GenAI for time series modelling
GenAI
Graph RAG
LLM Ops
LLMs and knowledge graphs
LLMs training and tuning
LLMs
Prompt engineering
Retrieval Augmented Generation (RAG)
VLMs
Math and Statistics
Distances
Linear Algebra
Math and Statistics
Mathematical Optimization
Monte Carlo methods
Probability Theory
Singular Value Decomposition (SVD)
Supervised Learning
Class imbalance
Classification
Data augmentation
Ensemble learning
Feature selection
Gaussian Process
Gradient boosting
Model selection and tuning
Model validation and drift
Random forest
Regression
Regularized regression
Supervised Learning
Unsupervised learning
Clustering
Dimensionality reduction and low-rank modeling
Principal component analysis (PCA)
Robust PCA
Sparse dictionary learning
Unsupervised learning
CarlosGG's AI Knowledge Garden 🪴
Active learning
AI for scientific discovery
Artificial Intelligence (AI)
Anomaly and Outlier Detection
Automated planning
AutoML
Bayesian modelling
Evolutionary computation
Fair AI
Feature learning
Federated learning
Forecasting
Knowledge representation and reasoning
Learning to rank
Machine Learning (ML)
Multi-task learning
Neuro-symbolic AI
Natural Language Processing (NLP)
One, few-shot learning
Problem Solving and Search
Quantum Machine Learning (QML)
Reinforcement learning (RL)
Self-supervised learning
Semi-supervised learning
Time series analysis
Transfer learning
Weakly supervised learning
XAI
GenAI for audio
See
GenAI
Resources
Open ASR Leaderboard - a Hugging Face Space by hf-audio
Code
#CODE
Whisper
#CODE
CrisperWhisper
References
#PAPER
Robust Speech Recognition via Large-Scale Weak Supervision (2022)
#PAPER
MusicLM: Generating Music From Text (Agostinelli 2023)
https://google-research.github.io/seanet/musiclm/examples/